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Mitochondrial and 
Y-chromosomal profile of the 
Kazakh population from East 
Kazakhstan 



Aim To study the genetic relationship of Kazakhs from East 
Kazakhstan to other Eurasian populations by examining 
paternal and maternal DNA lineages. 

Methods Whole blood samples were collected in 2010 
from 160 unrelated healthy Kazakhs residing in East Ka- 
zakhstan. Genomic DNA was extracted with Wizard* ge- 
nomic DNA Purification Kit. Nucleotide sequence of hy- 
pervariable segment I of mitochondrial DNA (mtDNA) was 
determined and analyzed. Seventeen Y-short tandem re- 
peat (STR) loci were studied in 67 samples with the Amp- 
FiSTRY-filer PCR Amplification Kit. In addition, mtDNA data 
for 2701 individuals and Y-STR data for 677 individuals were 
retrieved from the literature for comparison. 

Results There was a high degree of genetic differentiation 
on the level of mitochondrial DNA. The majority of mater- 
nal lineages belonged to haplogroups common in Central 
Asia. In contrast, Y-STR data showed very low genetic diver- 
sity, with the relative frequency of the predominant haplo- 
type of 0.61 2. 

Conclusion The results revealed different migration pat- 
terns in the population sample, showing there had been 
more migration among women. mtDNA genetic diversity 
in this population was equivalent to that in other Central 
Asian populations. Genetic evidence suggests the exis- 
tence of a single paternal founder lineage in the popula- 
tion of East Kazakhstan, which is consistent with verbal ge- 
nealogical data of the local tribes. 
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In terms of population genetics, Central Asia is one of the 
least studied regions in the world. The studies conducted 
in the region, based on scarce genetic data, indicate that 
the Central Asia population is a mix of Eastern and Western 
populations (1,2). Kazakhstan is a vast country, which has 
throughout history been inhabited by different nomadic 
tribes such as the Argyn, Dughlat, Jalayir, Kerei, Kipchak, 
Madjar, Naiman, and others (3). The Kazakh ethnic group 
was formed in the 15th century under a huge infulence 
of the Mongol Empire (4). We expected the genetic profile 
of Kazakhs to be heterogeneous because of the different 
tribes and ethnicities (5). 

The current study focused on the Kazakh population of the 
East Kazakhstan Province, because recently there have been 
many reports on the neighboring populations of Xinjiang 
Uyghur Autonomous Region and Altai regions. East Kazakh- 
stan is populated by the Naiman tribe. Their genealogical 
narrative, "shezhire" states that the Naiman people living in 
Tarbagatay region are descendants of one ancestor, named 
Toktar-kozha, who came from the territory of modern Uz- 
bekistan and was a Sart by origin. Based on the data from 
"shezhire," we formed a hypothesis of uniform paternal de- 
scent of the Naiman tribe. The aim of this study was to bet- 
ter understand the origins and differentiation of the Kazakh 
ethnic group and to investigate the genetic relationship be- 
tween this population and other Eurasian populations. 




MATERIALS AND METHODS 
Participants and reference data 

A total of 160 blood samples were collected from healthy 
adult individuals, 67 men and 93 women, during an expe- 
dition to Tarbagatay region, East Kazakhstan in 2010. Prior 
to the expedition, ethical approval was received from the 
Ethics Committee of the National Center for Biotechnology 
of the Republic of Kazakhstan (No.10, 14.02.2010). The Eth- 
ics Committee approved the informed consent form and 
questionnaire form designed specifically for the study. 

The ethnic origin of sampled individuals was ascertained 
up to three generations. Blood was taken with the in- 
formed consent signed by all donors. In addition, all par- 
ticipants completed the questionnaire that included in- 
formation on the geographic origin, nationality, maternal 
and paternal pedigree, and health issues. Related individu- 
als were not included into sampling. In addition, mtDNA 
haplogroup data for 2701 individuals and Y-short tandem 
repeat (STR) haplotype data for 677 individuals of different 
ethnic backgrounds were retrieved from the literature to 
establish the genetic relationship of Kazakhs from East Ka- 
zakhstan with other populations (Supplementary Table 1 
and Supplementary Table 2) (Figure 1). 




FIGURE 1. (A) Map of Eurasia and locations of samples for the cited mitochondrial DNA studies. The following abbreviations were used to 
assign locations of population sampling: AK- Altaian Kazakhs (6), KZ1 - Kazakhs (1), KZ2 - Kazakhs (7), KR - Kirghiz (1), MN - Mongolians 
(6), Ul - Uighurs (7), AL - Altaians (8), KH - Khakassians (8), BU - Buryats (8), SO - Sojots (8), TD - Todjins (8), TU - Tuvinians (8), TO - Tofalars 
(8), MA - Mansi (9), KE - Ket (1 0), NG - Nganasan (9), TB - Tubalar (11), EV - Evenki (1 1), NE - Negedal (1 1), UL - Ulchi (1 1), Nl - Nivkhi (1 1), UD 
- Udegey (11), IT - Itelmen (12), KO - Koriak (12), CH - Chukchi (13), AN - Turks (14), Gl - Gilaki (14), Kl - Kurdish (14), LU - Lur (14), UZ - Uz- 
bek (14), TK- Turkmen (14), SH - Shugnan (14), TG - Kazakhs (this study). (B). Map of Eurasia and locations of samples for the cited Y- short 
tandem repeats (STR) studies The following abbreviations were used to assign locations of population sampling: AK1 - Altaian Kazakhs 
(15), AK2 - Altaian Kazakhs (15), KM - Kalmyks (16), KZ1 - Kazakhs (17), KZ2 - Kazakhs (18), KG - Kirghiz (17), MG1 - Mongolians (19), MG2 - 
Mongolians (19), TG -Tarbagatay (this study), Ul - Uighurs (17), UI2 - Uighurs (19), UZ - Uzbeks (18), KZ3 - Kazakhs from South Kazakhstan 
(accession number YA003729, www.yhrd.com) 
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Mitochondrial DNA analysis 

Blood collection was performed with EDTA-containing 
evacuated blood collection tubes (Terumo* Leuven, Bel- 
gium) and blood collecting needles (Terumo®). Blood sam- 
ples were stored in portable refrigerators until DNA was 
extracted. DNA extraction and purification was performed 
with Wizard 1 " genomic DNA Purification Kit (Promega, Madi- 
son, Wl, USA), according to the manufacturer's instructions. 

Polymerase chain reaction (PCR) amplification. Mitochon- 
drial DNA polymorphisms were typed for 160 individuals 
by using conventional PCR, with primers specific to hyper- 
variable segment I (HVS-I) (20). PCR reactions were carried 
out in 25-uL reaction volume, containing 3.2 pmol of each 
primer, 10 ng of genomic DNA, 0.2 units ofTaq pol enzyme 
(Lytech, Moscow, Russia), 200 uM of each dNTP, 1 xPCR 
buffer, and 2.5 mM MgCI 2 (Lytech). The PCR protocol was: 
94°Cfor 10 minutes, followed by 35 cycles of denaturation 
at 94°C for one minute, annealing at 55°C for one minute, 
extension at 72°C for one minute, and a final extension step 
at 72°C for 7 minutes. PCR products were initially checked 
in 1 .5% agarose gel and sequenced on ABI 3730x1 DNA an- 
alyzer (Applied Biosystems/Hitachi, Tokyo, Japan). 

Sequence alignment and haplogroup analysis. The ob- 
tained sequences were compared with the revised Cam- 
bridge Reference Sequence (rCRS, NC_012920) to find 
polymorphisms by using SeqScape v 2.6 software (Applied 
Biosystems, Foster City, CA, USA). Identified HVS-I polymor- 
phisms were used for assignment of mtDNA haplogroups. 
We also included published data on other Eurasian popu- 
lations for comparative analysis (Supplementary Table 1). 

Data analysis. Mitochondrial DNA haplogroup frequencies 
and haplogroup diversity values were calculated as de- 
scribed elsewhere (21). The haplogroups were combined 
into 15 groups to compare East Kazakhstan data with the 
published data sets. In case of American populations, only 
A, B, C, and D mtDNA haplogroup frequency data were 
used for comparison (Supplementary Table 3). Popula- 
tion pairwise genetic distances were calculated from hap- 
logroup frequencies of the Tarbagatay population and 32 
other Eurasian populations with Statistica v.10 software 
(StatSoft, Tulsa, OK, USA). 

Y chromosome analysis 

Cenotyping. Sixty-seven DNA samples were amplified with 
the AmpFLSTR Yfiler® PCR Amplification Kit (Applied Bio- 



systems, Warrington, UK) according to the manufactur- 
ers' instructions. It includes 17 Y-specific STR loci (DYS456, 
DYS389, DYS390 23, DYS389, DYS458, DYS19, DYS385 a 
and b, DYS393, DYS391, DYS439, DYS635, DYS392, DYS437, 
DYS438, DYS448, GATA 22) that were typed in three mul- 
tiplexes on an ABB 100 Genetic Analyzer (Applied Biosys- 
tems/Hitachi). A quality control check was performed by 
successful completing of the quality assessment exercise 
in 201 1 . After certification, the data were deposited to the 
Y-Chromosome Haplotype Reference Database (accession 
number YA003700). 

Data analyses. Fragment sizes were determined using the 
GeneScan 3.1.2 software (Applied Biosystems) and allele 
designations were performed using the Genotyper 2.5.2 
software (Applied Biosystems). We also included pub- 
lished data on Altaian Kazakhs, Kalmyks, Kazakhs, Kyrgyzs, 
Mongolians, Uighurs, and Uzbeks (Supplementary Table 
2). Haplotype diversity was calculated as described in the 
literature (22). Discrimination capacity was calculated as 
D = N diff /N, where N djff isthe number of different haplotypes 
of the population. Basic parameters of molecular diversity 
and population genetic structure, including Slatkin's Rst 
matrices for pairwise genetic distances were calculated 
using the software package Arlequin 3.5.1.2 (University of 
Bern, Bern, Switzerland). The statistical significance (P-val- 
ues) was estimated by permutation analysis, using 10100 
permutations. The STATISTICA package (StatSost, Tulsa, OK, 
USA) was used for multidimensional scaling (MDS) analysis. 
Y predictor by Vadim Urasin, v.1 .5.0 was used for Y- STR hap- 
logroup prediction (http://predictor.ydna.ru/). 

RESULTS 

The study of mtDNA haplogroup variation in the popula- 
tion of East Kazakhstan region revealed that the majority 
of maternal lineages were distributed among haplogroups 
that were common in Central Asian region (Table 1). 

Comparison of 33 Eurasian populations based on genet- 
ic distances was performed and the results are presented 
as a multi-dimensional scaling plot (Figure 2). The popu- 
lations genetically closest to the Tarbagatay population 
were Mongolians from Mongolia (MN), Altaian Kazakhs 
from South Siberia, Russia (AK), and Kazakhs from Xinjiang, 
China (KZ2). 

In fact, all of them had equal haplogroups A, B, C, D, F, G, 
H, and M, suggesting that these lineages were in the 
common maternal gene pool from which these dif- 
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ferent lineages had emerged. However, there were some 
notable differences between them. For example, Kazakhs 
from Xinjiang had higher frequencies of West Eurasian lin- 
eages H, T, U, and Z than Mongolians, Altaian Kazakhs, or 
Kazakhs from East Kazakhstan. Kazakhs from East Kazakh- 
stan had greater diversity of haplogroups present in their 
mtDNA gene pool than the groups from South Siberia, 
Xinjiang, or Mongolia. Since recent data showed a com- 
mon ancestry of indigenous Altaians with Native Ameri- 
cans (23), we compared East Kazakhstan population with 



the Native Amerindians. A genetic-distance analysis indi- 
cated divergence from the Native American populations 

(Supplementary Table 3). 

Analysis of 1 7 fast evolving Y-STRs provided additional de- 
tails that helped to elucidate the paternal diversity in the 
Kazakh population. A total of 26 different haplotypes were 
identified in 67 individuals. Twenty-four haplotypes were 
nonrecurring. Paternal genetic variation within the popu- 
lation was rather low. Relative frequency of haplotype 15- 



TABLE 1 . Mitochondrial DNA haplogroup frequencies in the selected populations of Eurasia* 





/\ 

(%) 


g 

(%) 


Q 

(%) 


rj 

(%) 


p 

(%) 


(%) 


H 

(%) 


j 

(%) 
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(%) 
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(%) 
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(%) 


j 

(%) 


u 

(%) 


1_ 

(%) 


Other 

(%> 


AK 


4.2 


9.3 


10.5 


17.6 


7.2 


9.2 


10.9 


3 


0.8 


5.4 


5.1 


1.7 


8 


0.4 


6.7 


KZ1 


9.1 


5.4 


7.3 


18.2 


3.6 


5.5 


21.9 


0 


0 


7.3 


0 


7.3 


5.4 


1.8 


7.2 


KZ2 


3.8 


3.8 


13.2 


13.2 


7.5 


3.8 


13.1 


1.9 


0 


9.5 


0 


7.5 


3.8 


11.3 


7.6 


MN 


3.4 


2.2 


18 


31.4 


6.7 


6.7 


5.6 


6.7 


3.4 


7.9 


0 


2.2 


1.1 


3.4 


1.3 


KR 


4.2 


6.4 


12.6 


20 


2.1 


8.4 


21 


5.3 


0 


6.3 


0 


1.1 


4.3 


1.1 


7.2 


YU 


7.3 


7.3 


1.8 


16.4 


7.3 


1.8 


10.9 


0 


3.6 


10.9 


0 


1.8 


16.4 


0 


14.5 


AL 


0 


3.6 


19.1 


15.5 


9.1 


1.8 


6.4 


3.6 


0 


7.3 


7.2 


0.9 


16.4 


4.6 


4.5 


KH 


3.8 


3.8 


35.8 


13.2 


22.6 


0 


3.8 


1.9 


0 


0 


1.9 


1.9 


11.3 


0 


0 


BU 


2.2 


6.6 


28.5 


33 


1.1 


14.3 


2.2 


2.2 


0 


3.3 


0 


1.1 


1.1 


1.1 


3.3 


SO 


10 


3.3 


20 


46.8 


0 


6.7 


0 


0 


3.3 


0 


0 


0 


3.3 


0 


6.6 


TD 


4.2 


4.2 


47.6 


4.2 


2.1 


18.8 


2.1 


0 


0 


4.2 


0 


0 


6.3 


0 


6.3 


TU 


1.1 


7.8 


47.9 


17.8 


2.2 


6.6 


1.1 


5.6 


0 


0 


1.1 


1.1 


3.3 


1.1 


3.3 


TO 


5.2 


3.5 


62 


0 


0 


1.7 


6.9 


8.6 


0 


0 


0 


5.2 


0 


5.2 


1.7 


MA 


3.1 


0 


17.3 


8.3 


1 


6.1 


14.3 


12.2 


3.1 


1 


0 


7.2 


25.4 


0 


1 


KE 


7.9 


0 


15.8 


2.6 


23.7 


0 


10.5 


0 


0 


0 


0 


0 


34.2 


2.6 


2.7 


NG 


0 


0 


33.3 


29.2 


0 


0 


8.4 


0 


0 


0 


0 


0 


24.9 


4.2 


0 


TB 


11.1 


4.2 


19.4 


19.5 


1.4 


0 


5.6 


0 


0 


0 


6.9 


0 


26.3 


1.4 


4.2 


EV 


5.6 


0 


71.9 


21.1 


1.4 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


NE 


0 


12.1 


15.2 


24.3 


0 


27.2 


0 


0 


0 


0 


0 


0 


0 


0 


21.2 


UL 


0 


0 


13.7 


21.9 


1.1 


11.5 


0 


0 


0 


2.3 


6.9 


0 


0 


0 


42.6 


Nl 


0 


0 


0 


28.6 


0 


5.4 


0 


0 


0 


0 


0 


0 


0 


0 


66 


UD 


0 


0 


17.4 


0 


0 


0 


0 


0 


0 


28.3 


30.4 


0 


0 


0 


23.9 


IT 


6.4 


0 


14.9 


0 


0 


68.2 


0 


0 


0 


0 


0 


0 


0 


6.4 


4.1 


CH 


68.2 


0 


10.6 


12.1 


0 


9.1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


AN 


4 


0 


0 


2 


0 


2 


36 


8 


6 


0 


2 


8 


24 


0 


8 


Gl 


0 


0 


0 


2.7 


0 


0 


37.8 


16.3 


2.7 


2.7 


2.7 


16.2 


18.9 


0 


0 


Kl 


0 


0 


0 


0 


0 


0 


30 


10 


10 


0 


0 


0 


35 


0 


15 


LU 


0 


5.9 


0 


0 


0 


0 


41.1 


5.9 


5.9 


0 


0 


5.9 


35.3 


0 


0 


uz 


7.1 


0 


2.4 


9.4 


2.4 


2.4 


28.6 


7.1 


0 


11.9 


7.1 


4.8 


12 


0 


4.8 


TK 


2.4 


2.4 


7.1 


22 


2.4 


0 


32.1 


9.8 


0 


4.9 


2.4 


7.3 


4.8 


0 


2.4 


SH 


2.3 


0 


18.2 


0 


0 


0 


34 


4.5 


9.2 


0 


0 


2.3 


20.4 


0 


9.1 


KO 


5.2 


0 


36.1 


1.3 


0 


41.9 


0 


0 


0 


0 


0 


0 


0 


5.8 


9.7 


TG 


8.2 


8.8 


8.8 


34 


4.2 


3.4 


9.5 


1.4 


0.7 


5.4 


2 


2.7 


6.8 


1.4 


2.7 



•Abbreviations: AK - Altaian Kazakhs (6), KZ1 - Kazakhs (1), KZ2 - Kazakhs (7), KR - Kirghiz (1), MN - Mongolians (6), Ul - Uighurs (7), AL - Altaians (8), 
KH - Khakassians (8), BU - Buryats (8), SO - Sojots (8), TD - Todjins (8), TU - Tuvinians (8), TO - Tofalars (8), MA - Mansi (9), KE - Ket (10), NG - Ngana- 
san(9), TB-Tubalar(ll), EV- Evenki (11), NE - Negedal (11), UL - Ulchi (11), Nl - Nivkhi (11), UD - Udegey (11), IT-ltelmen (12), KO - Koriak (12), CH 
- Chukchi (13), AN - Turks (14), Gl - Gilaki (14), Kl - Kurdish (14), LU - Lur (14), UZ - Uzbek (14), TK - Turkmen (14), SH - Shugnan (14), TG - Kazakhs (this 
study). 
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1 2-29-23-1 0-1 3-1 2-1 3,1 8-1 0-1 2-15-1 9-1 5-1 7-19-1 2 (DYS 19, DYS458, DYS635, GATA H4) was 0.61 2 (Table 2). The hap- 
DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, lotype with an identical set of alleles was also found in the 
DYS385a,b, DYS438, DYS439, DYS437, DYS448, DYS456, population of southeastern Altaian Kazakhs, with the rela- 



TABLE 2. 17-locus Y-short tandem repeat (STR) haplotypes of 67 Kazakhs from East Kazakhstan 





nvc; nvc; 
19 3891 


nvc: 
389II 


nvc; 
390 


nvc; 

Ul J 

391 


DYS DYS DYS 
392 393 385 


nvc; 

Ul J 

438 


nvc; 

Ul J 

439 


Ul j 

437 


DYS DYS 
448 456 


nvc; 

Ul J 

458 


nvc; 

Ul J 

635 


VIRATA 

H4 


1-41 


15 12 


29 


23 


10 


13 




12 


13,18 


10 


12 


15 


19 


15 


17 


19 


12 


42-43 


15 12 


29 


22 


10 


10 




11 


13,18 


10 


10 


14 


22 


15 


16 


21 


11 


44 


15 15 


31 


25 


10 


11 




13 


13,18 


10 


11 


14 


20 


15 


17 


24 


10 


45 


15 12 


29 


23 


10 


13 




12 


13,19 


10 


12 


15 


19 


15 


16 


19 


12 


46 


16 14 


31 


24 




11 




13 


12,16 


10 


12 


14 


20 


16 


17 


23 


10 


47 


15 13 


29 


24 


10 


9 




13 


12,14 


11 


11 


14 


20 


17 


18 


21 


11 


48 


15 12 


29 


24 


10 


13 




13 


13,18 


10 


13 


14 


23 


15 


18 


22 


11 


49 


17 14 


32 


23 


11 


11 




13 


11,14 


11 


13 


14 


20 


16 


16 


23 


11 


50 


14 14 


31 


23 


11 


14 




14 


11,13 


14 


10 


14 


19 


13 


17 


23 


11 


51 


14 12 


30 


23 


11 


12 




10 


11,14 


12 


12 


15 


19 


16 


14 


24 


12 


52 


14 12 


30 


23 


11 


12 




9 


11,14 


15 


12 


15 


19 


13 


14 


24 


10 


53 


BHD 


29 


22 




12 




10 


13,18 


10 


11 


15 


19 


15 


15 


20 


11 


54 




29 


25 


10 


11 




13 


12,13 


12 


10 


14 


22 


15 


18 


21 


11 


55 


16 13 


30 


25 


11 


11 




13 


11,14 


11 


10 


14 


22 


16 


15 


20 


12 


56 


15 12 


29 


23 


10 


13 




8 


13,17 


10 


12 


15 


19 


15 


17 


19 


12 


57 


15 12 


31 


21 


10 


10 




15 


11,13 


10 


10 


14 


21 


15 


14 


21 


10 


58 


15 11 


29 


22 


9 


12 




11 


13,18 


10 


11 


15 


19 


15 


15 


20 


12 


59 


15 12 


28 


22 


9 


12 




10 


12,16 


10 


10 


14 


20 


18 


15 


20 


11 


60 


15 11 


29 


22 


9 


12 




10 


13,17 


10 


12 


15 


19 


15 


15 


20 


12 


61 


B^HH 


29 


22 


10 


12 




10 


11,13 


10 


11 


15 


19 


15 


15 


20 


12 


62 


15 12 


29 


23 


9 


10 




12 


13,17 


11 


10 


15 


19 


15 


16 


22 


11 


63 


Hl^H 


28 


23 


10 


13 




14 


13,18 


10 


12 


15 


19 


15 


17 


19 


12 


64 


15 14 


29 


23 


10 


7 




13 


13,18 


10 


13 


14 


18 


15 


18 


20 


11 


65 


16 13 


28 


26 


10 


11 




13 


13,18 




10 


14 


23 


15 


15 


20 


11 


66 


15 12 


29 


23 


10 


13 




12 


13,19 


10 


12 


15 


19 


15 


18 


19 


12 


67 


15 11 


31 


25 


12 


13 




11 


13,18 


12 


12 


17 


23 


15 


18 


26 


11 


TABLE 3. Rst value matrix of Central Asian populations* 




TG 


KM 


AK1 


AK2 




KZ1 




KG 


UI1 


KZ2 


UZ 




UI2 


MG1 


MG2 


KZ3 


TG 




C0.001 


<0.001 


<0.001 


<0.001 




<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


KM 


0.390 




0.120 


<0.001 


<0.001 




<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


AK1 


0.281 


0.019 




0.006 


<0.001 




<0.001 


<0.001 


0.382 


0.009 


<0.001 


<0.001 


<0.001 


0.020 


AK2 


0.259 


0.123 


0.091 




<0.001 




<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


0.001 


KZ1 


0.618 


0.654 


0.676 


0.662 






0.031 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


<0.001 


0.000 


KG 


0.467 


0.553 


0.516 


0.518 




0.036 




0.002 


<0.001 


<0.001 




0.007 


0.001 


<0.001 


<0.001 


UI1 


0.246 


0.428 


0.307 


0.369 




0.110 


0.075 




<0.001 


<0.001 




0.866 


0.035 


0.246 


<0.001 


KZ2 


0.070 


0.028 


-0.003 


0.041 




0.106 


0.080 


0.098 




0.013 


<0.001 


<0.001 


<0.001 


<0.001 


UZ 


0.221 


0.151 


0.085 


0.103 




0.610 


0.437 


0.284 


0.013 




<0.001 


<0.001 


<0.001 


<0.001 


UI2 


0.358 


0.514 


0.432 


0.478 




0.12E 




0.072 


-0.021 


0.080 


0.385 






0.083 


0.870 


<0.001 


MG1 


0.446 


0.584 


0.544 


0.553 




0.174 


0.101 


0.036 


0.103 


0.450 




0.029 




0.106 


<0.001 


MG2 


0.442 


0.572 


0.549 


0.538 




0.151 




0.095 


0.007 


0.124 


0.475 




0.016 


0.018 




<0.001 


KZ3 


0.357 


0.112 


0.063 


0.068 




0.674 


0.538 


0.423 


0.031 


0.128 




0.529 


0.601 


0.585 





*Rst values are located in the lower part of the matrix. Fst P-values were calculated from 10100 permutations and are located in the upper part 
of the matrix. TG - East Kazakhstan (this study), KM - Kalmyks (16), AK1 - Altaian Kazakhs (15), AK2 - Altaian Kazakhs (15), KZ1 - Kazakhs (17), KG - 
Kirghiz (17), Ul - Uighurs (17), KZ2 - Kazakhs (18), UZ - Uzbeks (18), MG1 - Mongolians (19), MG2 - Mongolians (19) UI2 - Uighurs (19), KZ3 - Kazakhs 
(YA003729, www.yhrd.com). 
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tive frequency of 0.214 (15). The haplotype diversity of 17 
STR loci was 0.629 ±0.071 .The overall discrimination capac- 
ity was 0.388. We also reduced the 1 7-STR profile to a 5-STR 
profile (DYS389I, DYS390, DYS391, DYS392, and DYS393) to 



compare our population data with the published data sets. 
As a result of this reduction, 26 Altaian Kazakh haplotypes 
collapsed into 22 unique haplotypes, and the number of 
shared haplotypes increased. 



MN AK KZ2'? KZ1 



1 4 -1 2 -1 0 -0.8 -0 6 -0.4 -0.2 0 0 0 2 0 4 0 6 0 8 1 0 1 2 1 4 1 6 
Dimension 1 



FIGURE 2. Multi-dimensional scaling plot of pair differences 
based on mitochondrial DNA haplogroup frequencies in 
Eurasian populations. The following abbreviations were used to 
assign populations: AK - Altaian Kazakhs (6), KZ1 - Kazakhs (1), 
KZ2 - Kazakhs (7), KR - Kirghiz (1), MN - Mongolians (6), Ul - Ui- 
ghurs (7), AL - Altaians (8), KH - Khakassians (8), BU - Buryats (8), 
SO - Sojots (8), TD - Todjins (8), TU - Tuvinians (8), TO - Tofalars 
(8), MA - Mansi (9), KE - Ket (10), NG - Nganasan (9), TB - Tubalar 
(1 1), EV - Evenki (1 1), NE - Negedal (11), UL- Ulchi (11), Nl - Nivkhi 
(11), UD - Udegey (11), IT - Itelmen (12), KO - Koriak (12), CH - 
Chukchi (13), AN - Turks (14), Gl - Gilaki (14), Kl - Kurdish (14), LU 
- Lur (14), UZ - Uzbek (14), TK - Turkmen (14), SH - Shugnan (14), 
TG - Kazakhs (this study). 
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FIGURE 3. Multi-dimensional scaling plot of Rst distances based 
on Y-short tandem repeats (STR) haplotypes in Eurasian popula- 
tions. 



Rst values showed that population of East Kazakhstan re- 
mained distinctive even with a reduced number of Y-STRs. 
Based on the Slatkin's Rst matrix, the populations geneti- 
cally closest to the Tarbagatay population were Kazakhs 
from Kara-Kalpakia (Kazakhs2), southeastern Altaian Ka- 
zakhs (Altaian Kazakhs2), Uighurs from South East Kazakh- 
stan (Uighursl), and Uzbeks from Kara-Kalpakia (Uzbeks) 
(Table 3). Our population showed large genetic distances 
from Kyrgyzs and Kalmyks. MDS plot showed that the stud- 
ied population was genetically closest to population of Uz- 
beks from Uzbekistan (Kara-Kalpakia) (Figure 3). 

DISCUSSION 

This study found a high degree of genetic differentiation 
on the level of mitochondrial DNA, but very low genetic 
diversity ofY-STRdata. 

Genetically close subpopulations of Kazakhs from Altai, 
Kazakhstan, and Xinjiang showed a similar mtDNA com- 
position consisting of mainly East Eurasian haplogroups. 
Affinities among these populations may result from their 
common origin or a recent admixture resulting from geo- 
graphic proximity. Genetic distances between populations 
can be related to geographic distances, according to a 
model of isolation by distance (22). Comparably high fre- 
quencies of European lineages are consistent with the in- 
termediate position of the East Kazakhstan region in Eur- 
asia but some inconsistent features were also present in the 
distribution of frequencies in mtDNA lineages. For example, 
relatively high frequencies of East Asian haplogroups (D, C, 
F and G) in this population imply significant contribution 
from Siberian and Central Asian populations. These differ- 
ences indicate that the population history of the studied 
group of Kazakhs was different from other groups. 

The paternal lineages present in theTarbagatay population 
were different from the populations of Kazakhs that have 
been studied before, with an exception of southeastern Al- 
taian Kazakhs, who had 19 haplotypes (21.35% out of the 
sample size) identical with the most common haplotype 
observed in East Kazakhstan. This could be explained by 
historical data indicating that Kazakhs began migrating to 
the Altai region in the 1 9th century. According to Oktyabr- 
skaya (24), there were 1777 people in the Kazakh Altai in 
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1880, 70% of them belonging to the genus Naiman and 
20% to the genus Kerey. 

Our results indicate that the studied subpopulation of Ka- 
zakhs has low paternal genetic diversity, and share the 
common paternal ancestry. The existence of a common 
ancestor is supported by the predominance of a single 
haplotype in the population. It is also supported by "she- 
zhire" which states that the Naiman tribe has a common 
single ancestor from Uzbekistan. These verbal genealogi- 
cal data are consistent with the results of the MDS plot. An 
alternative explanation for our results is provided by histo- 
rian Sultanov (25), who suggested that 240-360 thousand 
of nomads, including Naimans, migrated to the territory of 
modern Uzbekistan in the beginning of the 16th century. 
Further population studies are required to compare the 
genetic profiles of Naimans from Central and South-East- 
ern regions of Kazakhstan. In addition, the Naimans also 
inhabit the territory of Mongolia (Bayan-Olgii province), 
Russia (Altai Republic), and China (Xinjiang Uygur Autono- 
mous Region). According to unofficial estimates, the popu- 
lation of Naimans in 1917 was over 800 thousand people. 
Unfortunately, population studies of the Kazakh tribes are 
currently not being conducted. 

In addition, we genotyped 99 male individuals from South 
Kazakhstan region populated by a different tribe (accession 
number YA003729, www.yhrd.com). Not a single 1 7 Y-STR 
haplotype was found to be common between South and 
East Kazakhstan region. This implies different paternal origin 
of the tribes and genetic substructuring among Kazakhs. 

The comparison with the central "star cluster" profile, de- 
scribed by Zerjal et al (4), showed that only one haplotype 
from East Kazakhstan can possibly be assigned to the "ge- 
netic legacy of the Mongols." It may be concluded that 
the influence of Genghis Khan's Y-chromosomal lineage 
was insignificant in spite of the two centuries long rule 
of Genghis Khan and his descendants over the Naimans 
(mid-1 3th to mid-1 5th century) (4,26-28). 

The studied subpopulation represents a genetically iso- 
lated group with a single paternal founder lineage differ- 
ent from the "star cluster"lineage.The presented results re- 
veal different migration patterns in East Kazakhstan region, 
showing more migration among women. Partially, this pa- 
ternal genetic uniformity could be explained by local tra- 
ditions, such as exogamy, that were strictly followed in the 
past and played a crucial role in conservation of the unique 
genetic properties. The population of Madjar from Torgay 



area had been following same traditions in the past aim- 
ing to avoid inbreeding (3). Nonetheless, mtDNA genetic 
diversity in this population is equivalent to that in other 
Central Asian populations. 

Our finding on active migration of maternal DNA requires 
additional explanation and further support from comple- 
mentary sources, especially since the studied nomadic 
tribe is scarcely described in historical sources. Apart from 
additional genotyping of the Kazakh population, further 
cultural and historical studies are needed to gain more 
knowledge on the cultural processes that have greatly in- 
fluenced genetic variability of this and other populations. 

Funding received from the Ministry of Education and Science of the Repub- 
lic of Kazakhstan, grant No. 1.04.01. This study was also partially funded by 
Russian Foundation for Basic Research, grant 1 2-04-9091 5. 

Ethical approval received from the Ethics Committee of the National Cen- 
ter for Biotechnology of the Republic of Kazakhstan, Astana, Kazakhstan 
(No.10, 14.02.2010). 

Declaration of authorship PVTand EVZ contributed to study design, sam- 
ple collection, lab work, and manuscript preparation. ARA contributed to 
sample collection. ZMN contributed to sample collection and lab work. 
ZMS contributed to manuscript preparation. TKR contributed to sample 
collection and study design. EMR contributed to study design. 

Competing interests All authors have completed the Unified Competing 
Interest form at www.icmje.org/coi_disclosure.pdf (available on request 
from the corresponding author) and declare: no support from any organi- 
zation for the submitted work; no financial relationships with any organiza- 
tions that might have an interest in the submitted work in the previous 3 
years; no other relationships or activities that could appear to have influ- 
enced the submitted work. 

References 

1 Comas D, Calafell F, Mateu E, Perez-Lezaun A, Bosch E, Martinez- 
Arias R, et al. Trading genes along the silk road: mtDNA sequences 
and the origin of central Asian populations. Am J Hum Genet. 
1998;63:1824-38. Medline:9837835doi:10.1086/302133 

2 Lalueza-Fox C, Sampietro ML, Gilbert MT, Castri L, Facchini 
F, Pettener D, et al. Unravelling migrations in the steppe: 
mitochondrial DNA sequences from ancient central Asians. 
Proc Biol Sci. 2004,271 :941 -7. Medline:1 5255049 doi:1 0.1 098/ 
rspb.2004.2698 

3 Biro AZ, Zalan A, Volgyi A, Pamjav H. A Y-chromosomal comparison 
of the Madjars (Kazakhstan) and the Magyars (Hungary). Am J 
Phys Anthropol. 2009;1 39:305-1 0. Medline:1 91 70200 doi:1 0.1 002/ 
ajpa.20984 

4 Zerjal T, Xue Y, Bertorelle G, Wells RS, Bao W, Zhu S, et al.The 
genetic legacy of the Mongols. Am J Hum Genet. 2003;72:717-21. 
Medline:1 2592608 doi:1 0.1 086/367774 

5 Olcott MB. The Kazakhs. 2nd ed. Stanford (CA): Hoover Institution 
Press, Stanford University Press; 1995. 

6 Gokcumen O, Dulik MC, Pai AA, Zhadanov SI, Rubinstein S, 
Osipova LP, et al. Genetic variation in the enigmatic Altaian 
Kazakhs of South-Central Russia: insights into Turkic 



www.cmj.hr 



POPULATION GENETICS 



Croat Med J. 2013;54:17-24 



population history. Am J Phys Anthropol. 2008;1 36:278-93. 
Medline:1 832291 5 doi:l 0.1 002/ajpa.20802 

7 Yao YG, Kong QP, Wang CY, Zhu CL, Zhang YP. Different matrilineal 
contributions to genetic structure of ethnic groups in the silk road 
region in china. Mol Biol Evol. 2004;21:2265-80. Medline:1 531 7881 
doi:1 0.1 093/molbev/msh238 

8 Derenko MV, GrzybowskiT, Malyarchuk BA, Dambueva IK, 
Denisova GA, Czarny J, et al. Diversity of mitochondrial DNA 
lineages in South Siberia. Ann Hum Genet. 2003;67:391-41 1 . 
Medline:1 294091 4 doi:1 0.1 046/j.l 469-1 809.2003.00035.x 

9 Derbeneva OA, Starikovskaya EB, Wallace DC, Sukernik Rl. Traces 
of early Eurasians in the Mansi of northwest Siberia revealed by 
mitochondrial DNA analysis. Am J Hum Genet. 2002,70:1 009-1 4. 
Medline:1 1 845409 doi:1 0.1 086/339524 

10 Derbeneva OA, Starikovskaia EB, Volod'ko NV, Wallace DC, 
Sukernik Rl. Mitochondrial DNA variation in Kets and Nganasans 
and the early peoples of Northern Eurasia [in Russian]. Genetika. 
2002;38:1 554-60. Medline:1 2500682 

1 1 Starikovskaya EB, Sukernik Rl, Derbeneva OA, Volodko NV, Ruiz- 
Pesini E, Torroni A, et al. Mitochondrial DNA diversity in indigenous 
populations of the southern extent of Siberia, and the origins of 
Native American haplogroups. Ann Hum Genet. 2005;69:67-89. 
Medline:1 5638829 doi:1 0.1 046/j.l 529-881 7.2003.00127.x 

1 2 Schurr TG, Sukernik Rl, Starikovskaya YB, Wallace DC. Mitochondrial 
DNA variation in Koryaks and Itel'men: population replacement 

in the Okhotsk Sea-Bering Sea region during the Neolithic. Am 
J Phys Anthropol. 1999;108:1-39. Medline:9915299 doi:10.1002/ 
(SICI) 1 096-8644(1 99901 ) 1 08:1 <1 ::AID-AJPA1 >3.0.CO;2-1 

1 3 Starikovskaya YB, Sukernik Rl, Schurr TG, Kogelnik AM, Wallace DC. 
mtDNA diversity in Chukchi and Siberian Eskimos: implications for 
the genetic history of Ancient Beringia and the peopling of the 
New World. Am J Hum Genet. 1 998;63:1 473-91 . Medline:9792876 
doi:1 0.1 086/302087 

1 4 Quintana-Murci L, Chaix R, Wells RS, Behar DM, Sayar H, Scozzari 
R, et al. Where west meets east: the complex mtDNA landscape 
of the southwest and Central Asian corridor. Am J Hum Genet. 
2004;74:827-45. Medline:1 5077202 doi:1 0.1 086/383236 

15 DulikMC, Osipova LP, Schurr TG.Y-chromosome variation in Altaian 
kazakhs reveals a common paternal gene pool for Kazakhs and 
the influence of mongolian expansions. PLoS ONE. 201 1 ;6:e1 7548. 
Medline:21412412doi:10.1371/journal.pone.0017548 

16 Nasidze I, Quinque D, Dupanloup I, Cordaux R, Kokshunova 
L, Stoneking M. Genetic evidence for the Mongolian 
ancestry of Kalmyks. Am J Phys Anthropol. 2005;1 28:846-54. 
Medline:! 6028228 doi:1 0.1 002/ajpa.201 59 



17 Perez-Lezaun A, Calafell F, Comas D, Mateu E, Bosch E, Martinez- 
Arias R, et al. Sex-specific migration patterns in Central Asian 
populations, revealed by analysis of Y-chromosome short 
tandem repeats and mtDNA. Am J Hum Genet. 1 999;65:208-1 9. 
Medline:1 0364534 doi:1 0.1 086/302451 

18 Chaix R, Austerlitz F, KhegayT, Jacquesson S, Hammer MF, Heyer E, 
et al. The genetic or mythical ancestry of descent groups: lessons 
from the Y chromosome. Am J Hum Genet. 2004;75:1 1 1 3-6. 
Medline:1 5467979 doi:1 0.1 086/425938 

1 9 Xue Y, Zerjal T, Bao W, Zhu S, Shu Q, Xu J, et al. Male demography in 
East Asia: a north-south contrast in human population expansion 
times. Genetics. 2006;172:2431-9. Medline:16489223 doi:10.1534/ 
genetics.1 05.054270 

20 Vigilant L, Pennington R, Harpending H, KocherTD, Wilson AC. 
Mitochondrial DNA sequences in single hairs from a southern 
African population. Proc Natl Acad Sci USA. 1989;86:9350-4. 
Medline:2594772 doi:l 0.1 073/pnas.86.23.9350 

21 Nei M. Analysis of gene diversity in subdivided populations. Proc 
Natl Acad Sci U S A. 1 973,70:3321-3. Medline:451 9626 doi:1 0.1 073/ 
pnas.70.12.3321 

22 Cavalli-Sforza LL, Feldman MW. Spatial subdivision of populations 
and estimates of genetic variation. Theor Popul Biol. 1990;37:3-25. 
Medline:2326767 doi:1 0.1 01 6/0040-5809(90)90024-P 

23 Dulik MC, Zhadanov SI, Osipova LP, Askapuli A, Gau L, Gokcumen 
O, et al. Mitochondrial DNA and Y chromosome variation provides 
evidence for a recent common ancestry between Native Americans 
and indigenous Altaians. Am J Hum Genet. 2012;90:229-46. 
Medline:22281 367 doi:1 0.1 016/j.ajhg.201 1 .1 2.014 

24 Oktyabrskaya VI. The Kazakhs of the Altai: ethnopolitical and 
sociocultural processes in the border areas of southern Siberia in 
XIX-XX centuries, dissertation [in Russian]. Novosibirsk (Russia); 
2004. 

25 SultanovTl.The nomadic tribes of the Aral Sea in XV-XVII centuries, 
[in Russian]. Moscow: Nauka; 1982. 

26 Klyashtorny SG, Savinov DG.The steppe empires in ancient Eurasia 
[in Russian]. Saint Petersburg (Russia): Faculty of Literacy of the St.- 
Petersburg State University; 2005. 

27 Akhmedov BA.The Uzbek Khanate [in Russian]. Moscow: Nauka; 
1965. 

28 Soucek S. A history of inner Asia. Cambridge (UK): Cambridge 
University Press; 2000. 



www.cmj.hr 



